NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast multiplication of random dense matrices with sparse matrices

Liang, Tianyu; Murray, Riley; Buluc, Aydin; Demmel, James (May 2024, IEEE International Parallel & Distributed Processing Symposium 2024)

This work focuses on accelerating the multiplication of a dense random matrix with a (fixed) sparse matrix, which is frequently used in sketching algorithms. We develop a novel scheme that takes advantage of blocking and recomputation (on- the-fly random number generation) to accelerate this operation. The techniques we propose decrease memory movement, thereby increasing the algorithm’s parallel scalability in shared memory architectures. On the Intel Frontera architecture, our algorithm can achieve 2x speedups over libraries such as Eigen and Intel MKL on some examples. In addition, with 32 threads, we can obtain a parallel efficiency of up to approximately 45%. We also present a theoretical analysis for the memory movement lower bound of our algorithm, showing that under mild assumptions, it's possible to beat the data movement lower bound of general matrix-matrix multiply (GEMM) by a factor of sqrt(M), where $$M$$ is the cache size. Finally, we incorporate our sketching method into a randomized algorithm for overdetermined least squares with sparse data matrices. Our results are competitive with SuiteSparse for highly overdetermined problems; in some cases, we obtain a speedup of 10x over SuiteSparse.
more » « less
An O(N) distributed-memory parallel direct solver for planar integral equations

https://doi.org/10.1109/IPDPS57955.2024.00046

Liang, Tianyu; Chen, Chao; Martinsson, Per-Gunnar; Biros, George (May 2024, IEEE)

Full Text Available
RCHOL: Randomized Cholesky Factorization for Solving SDD Linear Systems

https://doi.org/10.1137/20M1380624

Chen, Chao; Liang, Tianyu; Biros, George (January 2021, SIAM Journal on Scientific Computing)

Full Text Available
Chemically Triggered Click and Declick Reactions: Application in Synthesis and Degradation of Thermosetting Plastics

https://doi.org/10.1021/acsmacrolett.1c00548

Wu, Tianhong; Liang, Tianyu; Hu, Wei; Du, Meiqing; Zhang, Sijia; Zhang, Yanfeng; Anslyn, Eric V.; Sun, Xiaolong (September 2021, ACS Macro Letters)

Full Text Available

Search for: All records